Curved Text Detection


Curved text detection is the process of identifying and localizing text that is curved or non-linear in images.

Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces

Add code
Nov 13, 2025
Viaarxiv icon

Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling

Add code
Nov 17, 2025
Figure 1 for Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling
Figure 2 for Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling
Figure 3 for Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling
Figure 4 for Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling
Viaarxiv icon

A Large-scale Dataset for Robust Complex Anime Scene Text Detection

Add code
Oct 09, 2025
Viaarxiv icon

ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts

Add code
Oct 06, 2025
Figure 1 for ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts
Figure 2 for ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts
Figure 3 for ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts
Figure 4 for ViTs: Teaching Machines to See Time Series Anomalies Like Human Experts
Viaarxiv icon

SAViL-Det: Semantic-Aware Vision-Language Model for Multi-Script Text Detection

Add code
Jul 27, 2025
Viaarxiv icon

LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting

Add code
May 29, 2025
Figure 1 for LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
Figure 2 for LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
Figure 3 for LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
Figure 4 for LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
Viaarxiv icon

Generalized Visual Relation Detection with Diffusion Models

Add code
Apr 16, 2025
Viaarxiv icon

Edge Approximation Text Detector

Add code
Apr 05, 2025
Figure 1 for Edge Approximation Text Detector
Figure 2 for Edge Approximation Text Detector
Figure 3 for Edge Approximation Text Detector
Figure 4 for Edge Approximation Text Detector
Viaarxiv icon

Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis

Add code
Mar 17, 2025
Figure 1 for Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis
Figure 2 for Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis
Figure 3 for Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis
Figure 4 for Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis
Viaarxiv icon

OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment

Add code
Mar 03, 2025
Figure 1 for OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Figure 2 for OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Figure 3 for OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Figure 4 for OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment
Viaarxiv icon